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Abstract 

The scope of this paper is the presentation of a test that enables to detect 
heteroscedasticity in univariate regression model. The test is simple to com- 
pute and very general since no hypothesis is made on the regularity of the 
response function or on the normality of errors. Simulations show that our 
test fairs well with respect to other less general nonparametric tests. 
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1. Introduction 

Heteroscedasticity represents a serious problem in statistics and has to be 
taken into consideration when performing any econometric application that 
could be affected by the latter. Therefore, many statisticians have put a lot 
of effort into the elaboration of diagnostic tests enabling accurate detection 
of heteroscedasticity. In particular, a widely and commonly used test for 
heteroscedasticity is that proposed by White (1980). Note that this test does 
not presume a particular form of heteroscedasticity. 

There are several other tests for the assumption that errors are homos- 
cedastic. If we have prior knowledge that the variance is a linear function 
of explanatory variables, the Breusch-Pagan test (1979) is more powerful. 
Meanwhile, Koenker (1981) proposes a variant of the Breusch-Pagan test 
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that does not assume normally distributed errors. More recently, Luger (2010) 
proposes a simulation-based test with power to detect unspecified forms of 
heteroscedasticity and two completely nonparametric regression model tests 
are studied respectively by Dette and Munk (1998) and Li et Al. (2006). 

2. Framework 

Consider the sample data (xj, ^)i=i,..., n following the fix-designed regression 
model : 

Yi = fi(xi) + a(xi)ei, 1 < % < n (1) 

where \i and a are unknown functions supported on S. We are interested in 
testing the problem of homoscedasticity : 

Hq : 3o"q : V x E S, a 2 (x) = a 2 ,. 

We assume that the errors (ei) of the model are a sequence of unobserved 
independent random variables with continuous probability density functions 
symmetric with respect to and such that E(ef) = 1. 

Let's consider the sequence defined for alH = 1, . . . , n by : 

where /2 is an estimate of /j,. Under H , (ef) is a sequence of realisations of 
asymptotically independent random variables such that 

3 m > s.t. V i, P(e^ < m 2 ) = 1/2. 

An estimate rr? n of ml is the median of (ef )i=i,...,„. Let's consider the sequence 
in which the ith term is equal to 1 if e\ > rh n and if ej < rh n . The test 
statistic is the length of the longest run of 0's or l's of this sequence, L n . 
Under H , if \n/2\ denotes the floor of n/2 : 

Wx = 1, . . . , n, P(L n <x) = {x)/C^ , (2) 

where S]C^ 2 \x) be the number of sequences of 0's and l's of length n with 
[n/2j l's such that the length of the longest run of 0's or l's does not ex- 
ceed x. c]T^ 2 ^ is the binomial coefficient (and also the number of n-sequences 
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containing exactly \n/2\ l's). 

A recursive formula to compute Si? (x) has been presented in Aubin and 
Leoni-Aubin (prepublication). Then, the exact law of L n can be deduced un- 
der Hq. Therefore, a value of the test statistic greater than the (1 — a)th 
quantile of the deduced distribution indicates rejection of the null hypothesis 
of homoscedasticity. 



3. Results and Discussion 

3.1. Simulation study 

In this section a broad simulation study to illustrate the finite simple 
performance of the test of homoscedasticity described in the previous section 
is presented. 

To compare our procedure with others in the literature, some of the mo- 
dels simulated in this study are the same as the ones carried by other authors. 
In particular, the validity and the power of the test proposed in the present 
paper has been numerically compared with the tests of Dette and Munk 
(1998) and of Li et al. (2006) (denoted respectively in the following DM test 
and LWI test), choosing some of the regression models used in the simulation 
study shown in these studies. Artificial data are generated according to the 
regression model ([I]), with fixed and equally spaced design in the interval 
[0;1], Xi = (i — l)/(n — 1), i = l,...,n, indipendent and identically dis- 
tributed Af(0, 1) random errors, and where /i and a are given respectively 
by 

n(x) = 1 + sin(x), a(x) = 0.5e cx (model 1), 

[i(x) = l + x, cr(x) = 0.5(1 + csin(lOx)) 2 (model 2), 

(j,(x) = 1 + x, cr(x) = 0.5(1 + cx) 2 (model 3). 

Table 1 reports the sizes (for different nominal levels) and the powers of the 
three compared tests for sample sizes n = 50, 100 and c = 0, 0.5, 1 (c = 
is the case in which the homoscedasticity hypothesis is verified). Each table 
entry is obtained from 1000 independent runs. 

Since the distribution of the test statistic is discrete, we compute the sizes 
and the powers of the test for nominal levels equal to 4.1% and 9.8% in the 
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Table 1: Empirical sizes and powers of the test Hq with sample sizes 50 and 100. 



case n = 50 (and equal to 5.8% and 12.5% in the case n = 100). For DM 
and LWI tests, we consider nominal levels equal to 5% and 10% for all n. So, 
as far as these tests can be compared, simple inspection of Table 1 allows to 
deduce that the length of the longest run test is competitive with the DM 
test. The length of the longest run test fairs well, but LWI test usually gives 
slightly better powers. Nevertheless, the mean of the obtained powers for the 
considered cases is higher for the longest run test than for the LWI test. This 
comes from the fact that in the case of the model 2, especially for n = 50 
and c = 1, the length of the longest run test has a power greatly bigger than 
the other two tests. 

3.2. Conclusion 

The length of the longest run test gives good results, relaxes several as- 
sumptions made in most of other traditional tests and is, in this sense, more 
general. 
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